Adaptive Stack Algorithm in Document Image Decoding
نویسندگان
چکیده
The Stack algorithm, which is a best-first search algorithm widely used in speech recognition, is modified for application to the problem of recognizing machine printed text in the Document Image Decoding (DID) framework. An iterative scheme is described wherein successively more stringent Stack searches are performed, each time using a model of the image that is updated on the basis of what was discovered on the previous iteration. In this way, the algorithm can adapt to realistic degradation patterns that are irregular and perhaps not well described by stationary models. The contribution of this work is twofold: (1) it represents a reliable method of estimating suitable parameter values for Stack decoding in DID, and (2) as a means of handling nonstationary degradation, it presents an alternative to another recently developed approach that is described elsewhere, the Iterated Complete Path algorithm, at potentially lower computational cost. Preliminary results are presented on text line images with simulated nonstation-
منابع مشابه
A stack-based chaotic algorithm for encryption of colored images
In this paper, a new method is presented for encryption of colored images. This method is based on using stack data structure and chaos which make the image encryption algorithm more efficient and robust. In the proposed algorithm, a series of data whose range is between 0 and 3 is generated using chaotic logistic system. Then, the original image is divided into four subimages, and these four i...
متن کاملAdding linguistic constraints to document image decoding: comparing the iterated complete path and stack algorithms
Beginning with an observed document image and a model of how the image has been degraded, Document Image Decoding recognizes printed text by attempting to find a most probable path through a hypothesized Markov source. The incorporation of linguistic constraints, which are expressed by a sequential predictive probabilistic language model, can improve recognition accuracy significantly in the ca...
متن کاملA Novel Patch-Based Digital Signature
In this paper a new patch-based digital signature (DS) is proposed. The proposed approach similar to steganography methods hides the secure message in a host image. However, it uses a patch-based key to encode/decode the data like cryptography approaches. Both the host image and key patches are randomly initialized. The proposed approach consists of encoding and decoding algorithms. The encodin...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملStochastic attribute grammar model of document production and its use in document image decoding
Document Image Decoding (DID) refers to the process of document recognition within a communication theory framework. In this framework, a logical document structure is a message communicated by encoding the structure as an ideal image, transmitting the ideal image through a noisy channel, and decoding the degraded image into a logical structure as close to the original message as possible, on a...
متن کامل